PAGE: 1 RAW SEQUENCE LISTING DATE: 02/10/2000 

PATENT APPLICATION US/09/286,166 TIME: 16:29:54 

INPUT SET: S34717.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 
2 

3 ( 1 ) General Information: 
4 

5 (i) APPLICANT: FOWLKES, Dana M. . ■ . 

6 BROACH, Jim C INI I F P 

7 MANFREDI , John ^ 1 

8 KLEIN, Christine 

9 MURPHY , Andrew J. 

10 PAUL, Jeremy 

11 TRUEHEART, Joshua 
12 

13 (ii) TITLE OF INVENTION: YEAST CELLS ENGINEERED TO PRODUCE 

14 PHERMONE SYSTEM PROTEIN SURROGATES, AND USES THEREFOR 
15 

16 (iii) NUMBER OF SEQUENCES: 119 
17 

18 (iv) CORRESPONDENCE ADDRESS: 

19 (A) ADDRESSEE: LAHIVE AND COCKFIELD 

20 (B) STREET: 60 State Street, Suite 1 510 

21 (C) CITY: Boston 

22 (D) STATE: MA 

2 3 (E) COUNTRY: USA 

24 (F) ZIP: 02109 
25 

26 (V) COMPUTER READABLE FORM: 

27 (A) MEDIUM TYPE: Floppy disk 

28 (B) COMPUTER: IBM PC compatible 

29 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

30 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
31 

32 (vi) CURRENT APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/286,166 

34 (B) FILING DATE: 

35 (C) CLASSIFICATION: 
36 

37 (vii) PRIOR APPLICATION DATA: 

38 (A) APPLICATION NUMBER: US 08/461,383 

39 (B) FILING DATE: 05-JUN-1995 
40 

41 (A) APPLICATION NUMBER: US 08/322,137 

42 (B) FILING DATE: 13-0CT-1994 
43 

44 (vii) PRIOR APPLICATION DATA: 

45 (A) APPLICATION NUMBER: US 08/309,313 

46 (B) FILING DATE: 20-SEP-1994 
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RAW SEQUENCE LISTING DATE: 02/10/2000 

PATENT APPLICATION US/09/286,166 TIME: 16:29:54 

INPUT SET: S34717.raw 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/190,328 

(B) FILING DATE: 31-JAN-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/041,431 

(B) FILING DATE: 31-MAR-1993 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Vincent, Matthew P 

(B) REGISTRATION NUMBER: 36,709 

(C) REFERENCE/ DOCKET NUMBER: CPI-012CP4B 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-227-7400 

(B) TEEFAX: 617-227-5941 

(C) TELEX: 752806 

(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
1 5 10 15 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

lie Pro Ala Glu Ala Val lie Gly Tyr Leu Asp Leu Glu Gly Asp Phe 
35 40 45 

Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe lie Asn Thr Thr lie Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

Ser Leu Asp Lys Arg Glu Ala Glu Ala 
85 

(2) INFORMATION FOR SEQ ID NO : 2 : 



PAGE: 3 



100 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
152 



RAW SEQUENCE LISTING DATE: 02/10/2000 

PATENT APPLICATION US/09/286,166 TIME: 16:29:54 

INPUT SET: S34717.raw 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr Lys Arg Glu 
15 10 15 

Ala Glu Ala Glu Ala Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro 
20 25 30 



Met Tyr Lys Arg Glu Ala Asp Ala Glu Ala Trp His Trp Leu Gin Leu 
35 40 45 

Lys Pro Gly Gin Pro Met Tyr Lys Arg Glu Ala Asp Ala Glu Ala Trp 
50 55 60 

His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
65 70 75 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
AAGCTTAAAA GAATG 15 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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RAW SEQUENCE LISTING DATE: 02/10/2000 

PATENT APPLICATION US/09/286,166 TIME: 16:29:55 

INPUT SET: S34717.raw 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



AAA GAA GAA GGG GTA TCT TTG CTT AAGCTCGAGA TCT 37 
Lys Glu Glu Gly Val Ser Leu Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Glu Glu Gly Val Ser Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGTGAAGCTT AAGCGTGAGG CAGAAGCTNN KNNKNNKNNK NNKNNKNNKN NKNNKNNKNN 60 
KNNKNNKTGA TCATCCG 77 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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PATENT APPLICATION US/09/286,166 TIME: 16:29:55 

INPUT SET: S34717. raw 

206 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

207 

208 Lys Arg Glu Ala Glu Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

209 15 10 15 
210 

211 Xaa Xaa Xaa 

212 

213 

214 

215 (2) INFORMATION FOR SEQ ID NO : 8 : 
216 

217 (i) SEQUENCE CHARACTERISTICS: 

218 (A) LENGTH: 36 amino acids 

219 (B) TYPE: amino acid 

220 (D) TOPOLOGY: linear 
221 

222 (ii) MOLECULE TYPE: peptide 

223 

224 

225 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

226 

227 Met Gin Pro Ser Thr Ala Thr Ala Ala Pro Lys Glu Lys Thr Ser Ser 

228 1 5 10 15 
229 

230 Glu Lys Lys Asp Asn Tyr lie lie Lys Gly Val Phe Trp Asp Pro Ala 

231 20 25 30 
232 

233 Cys Val He Ala 

234 35 
235 

236 (2) INFORMATION FOR SEQ ID NO : 9 : 
237 

238 (i) SEQUENCE CHARACTERISTICS: 

239 (A) LENGTH: 19 base pairs 

240 (B) TYPE: nucleic acid 

241 (C) STRANDEDNESS : single 

242 (D) TOPOLOGY: linear 
243 

244 (ii) MOLECULE TYPE: synthetic DNA 

245 

246 

247 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

248 

24 9 AAGCTTTCGA ATAGAAATG 19 
250 

251 (2) INFORMATION FOR SEQ ID NO: 10: 
252 

25 3 (i) SEQUENCE CHARACTERISTICS: 

254 (A) LENGTH: 36 base pairs 

255 (B) TYPE: nucleic acid 

256 (C) STRANDEDNESS: double 

257 (D) TOPOLOGY: linear 
258 
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SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/286,166 



DATE: 02/10/2000 
TIME: 16:29:55 



INPUT SET: S3471 7. raw 



Line Error Original Text 



